Finding Global Optimum for Truth Discovery: Entropy Based Geometric Variance
نویسندگان
چکیده
Truth Discovery is an important problem arising in data analytics related fields such as data mining, database, and big data. It concerns about finding the most trustworthy information from a dataset acquired from a number of unreliable sources. Due to its importance, the problem has been extensively studied in recent years and a number techniques have already been proposed. However, all of them are of heuristic nature and do not have any quality guarantee. In this paper, we formulate the problem as a high dimensional geometric optimization problem, called Entropy based Geometric Variance. Relying on a number of novel geometric techniques (such as LogPartition and Modified Simplex Lemma), we further discover new insights to this problem. We show, for the first time, that the truth discovery problem can be solved with guaranteed quality of solution. Particularly, we show that it is possible to achieve a (1 + )-approximation within nearly linear time under some reasonable assumptions. We expect that our algorithm will be useful for other data related applications. 1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems Geometrical problems and computations
منابع مشابه
A New Method for Root Detection in Minirhizotron Images: Hypothesis Testing Based on Entropy-Based Geometric Level Set Decision
In this paper a new method is introduced for root detection in minirhizotron images for root investigation. In this method firstly a hypothesis testing framework is defined to separate roots from background and noise. Then the correct roots are extracted by using an entropy-based geometric level set decision function. Performance of the proposed method is evaluated on real captured images in tw...
متن کاملExploring Relevance as Truth Criterion on the Web and Classifying Claims in Belief Levels
The Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject. Several truth discovery methods have been proposed for various scenarios, and they have been successfully applied in diverse application domains. In this paper...
متن کاملA Level-Value Estimation Algorithm and Its Stochastic Implementation for Global Optimization
In this paper, we propose a new method for finding global optimum of continuous optimization problems, namely Level-Value Estimation algorithm(LVEM). First we define the variance function v(c) and the mean deviation function m(c) with respect to a single variable (the level value c), and both of these functions depend on the optimized function f(x). We verify these functions have some good prop...
متن کاملAn Entropy-Based Position Projection Algorithm for Motif Discovery
Motif discovery problem is crucial for understanding the structure and function of gene expression. Over the past decades, many attempts using consensus and probability training model for motif finding are successful. However, the most existing motif discovery algorithms are still time-consuming or easily trapped in a local optimum. To overcome these shortcomings, in this paper, we propose an e...
متن کاملReal Time Object Detection and 3D Modeling Using Fuzzy Logic
This paper OD3DM (Object detection and 3D modeling) mainly discussed the process to detect complex geometric objects and thereafter performing 3D modeling of geometric objects using Entropy based selection of optimum transformation of input data, wavelet based transformation and fuzzy logictechniques for designing and training of object recognition systems using realistic 3D computer graphics m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016